K | # of bigrams | # of trigrams | # of 4-grams | # of 5-grams | # of 6-grams |
---|---|---|---|---|---|
100 | 54 | 94 | 94 | 98 | 98 |
1000 | 249 | 526 | 783 | 911 | 962 |
10000 | 638 | 2190 | 4548 | 6723 | 8119 |
100000 | 1883 | 7875 | 19339 | 32704 | 43644 |
1000000 | 1883 | 7875 | 19339 | 32704 | 43644 |
Both the problem and the results are much similar to the previous subsection: We consider letter-N-grams at the end of words instead of the beginning.
3.8.1 Number of letter-N-grams at word beginnings